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ABSTRACT 

To determine the most effective language training 
activities fcr trainable m€?iitally retarded (TMR) children, the 
variables of degree of Pirevious language training, IQ, Peabody 
language treatment program versus Distar language treatment program, 
pretest verses posttest, and sex were examined with 122 TMR Ss (7 to 
1^ years old). Results of the Peabody Picture Vocabulary Test, the 
Illincis Test of Psycholinguistic Abilities, and the Mechann Verbal 
language Development Scale indicated that Ss were significantly 
better aided by the Distar program than by the Peabody program, and 
that only the children previously exposed to language training showed 
any significant growth in language. A second study examined in depth 
the nature of previous language experience on Dista r. performance . 
(IM) 
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INTRODUCTION 



This research project was designed to investigate the efficacy of 
providing intensified language stimulation programs to the trainable 
mentally retarded. Since there is a frequently reported and often observable 
deficiency in the language skills of this particular population of 
children, it would appear to be of significance to determine what effect, 
If any» a conceriied program of language stimulation will have. The 
outcomes of this experiment have a direct bearing upon the caseload 
composition of speech and language clinicians in the public schools , 
have implications for the types of testing and assessment procedures 
utilized with the trainable mentally retarded, and may definitely 
influence the degree and manner in which language services are provided 
to these children. In effect, the findings of the study can suggest 
whether the trainable mentally retarded can benefit from intensified 
language stimulation, and the amount of language stimulation required 
to achieve a significant degree of improvement. 

Improvements in language were assessed on sensitive instruments and 
a sub-population of the jsubjects in this project provided information 
concerning specific amounts of improvement in language abilities as a 
result of therapy sessions. This information is of value in pi'oviding 
for the evidence of the effectiveness of intensified language programs 
with this type of children. Such information is of value to other 
researchers ..ad experimenters and allows for the development of further 
important research hypotheses. 

3 
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PAST RESEARCH 

Despite the fact that a large percentage of the trainable mentally retarded 
children are enrolled in special classes in the schools and in spite of the 
research evidence which has consistently reported such children to have speech 
and language problems (Bangs, 1961; Brandfon, ^1951; Daum, 1953; Donovan, 1957; 
Evcrhart, 1953; Gens, 1950, 1951; Goodwin, 1955; Gottsleben, 1955; Harrison, 
1958; Irvin, 1942; Karlin and Kennedy, 1936; Karlin and Strazzula, 1952; Kennedy, 
1930; Kolstoe, 1958; Lewald, 1932; Lubman, 1955; Lyle, 1960; Masket, 1958; Matthews 
1957; Meader, 1940; Sachs, 1955; Schief elbusch, 1963; Schlanger, 1953b, 1953c; 
Schlanger and Gottsleben, 1957; Schneider and Vallon, 1954; Sheridan, 1948; 

Sirkin and Lyons, 1941; Tarjan, et. al., 1961; Town, 1913; Wood, 1957; Wolfensberge 

I 
f 

et. al,, 1963) there exists a paucity of important research with regards to the 
efficc^y of speech and languagtf programs with the trainable mentally retarded. 

Among children in special! classes , Matthews (1957) estimated an incidence . 

■ i 

of speech problems of 79 perceht. Lubman (1955) studied subjects with IQs below 
50 and noted that 95 percent had speech defects. Jo'hnson et. al., (I960) reported 
an incidence of about 57 percent in a study of trainable mentally retarded children 
Wood (1957) noted about 21 percent of a sample studied at a speech and hearing 
center to have language deficiencies associated with mental* retardation. This 
does not, however, indicate any. estimate of the number of mentally retarded who 
have language problems. 

. The estimates of the incidence of language deficiencies among the mentally 
retarded varies from less than 10 percent to almost 80 percent. This varimce 
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Is prj.marily due to the differences In the groups studied and the definitions 
of what constitutes a language problem. 

One of the major theoretical questions is whether laclt of language develop- 
ment among mentally retarded children is an inevitable coAsequence of mental 
retardation or whether intensive training can improve th^ rate of.lariguage develop- 
ment. The studies of language training programs for the retarded arc few. 
Since 1955 therapy with the rnentally retarded has empjhasized more than articulatory 
proficiency; it has demonstrated the necessity for providing appropriate language 
development programs. / 

Schneider and Vallon (1954) emphasize the ne^cessity for therapy with the 
severely retarded and challenge the view of We^^, Kennedy, and Carr (.1947), who 
thought that therapy with the severely retarded was useless, as being too pessi- 
mistic. They state that the simple ability to express, the wants or needs, of one- 
self in a socially approved manner, aion/ with the ability to merely express 
one's wants or needs, is an undeniable/asset to the child intellectually, 
emotionally, and socially • / 

In 1955, Schneider and Vallo^^reported on a therapy program for trainable 
retarded children in a day scho/l class. The children were categorized into 
three groups: (1) Delayed language development, (2) Insufficient language develop-- 
ment, and (3) Disturbances/of articulation. Appropriate therapy activities 
were presented to each g'roup for one year. The resultant data revealed gains 
for all groups. Thcs/^ judgments were, however, subjective, and n.o control group 
had been used. 

Johnson and /{lapobianco (1957) studied a group of severely retarded children 
following a ye/r of. language training; they reported no significant improvement. 
This study v/as noteworthy as one of the first experimental assessments of a 
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language program for the retarded in which the results were contrtidictory to 
preceding reports* 

Kilstoe (1958) observed the effect of a language training program with^Ji 
small group of mongoloid children. On five subtests of the Illinois Language 
Scale, the experimental group gained significantly over the controls during a 
five and one-half month's period. Rittmanic (1958) set .up a pilot program in 
group oral language with institutionalized retardates. Despite the lack of 
statistical evidence, he claimed that the program was successful. 

Smith (1962) 'conducted a language program for sixteen educable retarded 
children; he assessed the progress by using the ITPA. The experimental group 
showed a 6.75 month gain in Language Age during a three-month's period; the 
. controls declined .4 months in Language Age. Smith did not attempt to remediate 
any 'st)ecif ic disabilities. Improvement was, however , noted on all the language 
abilities as measured by the ITPA. Blue (1963) supervised a language program 
for trainable retardates similar to the previously described program by Smith. 
The program was conducted for an eleven-week period and utilized the ITPA for 
pre- and post- measurement. The experimental group showed a Language Age gain 
of 5.67 months as compared to the control group's 3.67 months. The difference 
was not statistically significant. This is considered one of the more prominent 
studies on the efficacy of language therapy for trainable retardates. 

Blessing (1964) reported on an experimental program which was designed 
to improve the vocal encoding of mentally retarded children. After a period 
of three-month's training the ITPA was used to note progress. The results re- 
vealed only a tendency toward improvement by the experimental group. 

Harvey, Yep, and Scllin (1966) reported on a two-year program for trainable 
mentally retarded children. Their program. emphasized, the areas of: (1) self- 
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concept development, (2) Social competence, (3) Motor coordination, and (4) 
Language development. Th-.ir results indicated highly significant improvements 
In the four areas. All scores, with the exception of . social competence, declined 
over the sunnner of the first year. This was interpreted to mean: (1) that 
there are differences between home and school environments, and (2) it is essential 
to maintain minimal programs during the summer for these children. The second 
year revealed significant increases in all areas. They concluded that evaluation 
of programs should be allowed to occur over longer periods of time, particularly 
with individuals with ley IQs . 

Richardson (1967) describes, a language training program for retarded children 
at the University of Oklahoma Child Study Center. It indicates that early 
sensory-motor training, beginning at the pre-verbal experience level is of 
utmost importance to the language development of these children. Methods used 
in the program are related to research evidence on the development of language 

and thinking which indicates that: (1) Early exposure to a variety of looking 

\ 

\ 

and listening experiences is important in language development, (2) Primary 
learning requires perceptual and pre-verbal experiences, (3) There is a close 
relationship- between motor movements- and perceptual development, (4) Language 
development requires the development of both motor and perceptual patterns, 
(5) The major source of internal mediators is the orienting response, (6) Lin- 
guistic labels serve to mediate learning processes, and (7) Language development 
is both a part of and a result of primary learning. 

Jordan (1967) reports that speech therapy outcome studies with the mentally 
retarded reveal that special psycholinguistic instruction can significantly 
increase psycholinguistic attainment. He suggests that programmed learning 
and operant conditioning be utilized to teach language to the mentally retarded. 

i 
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PoCCer and Mattson (1968) also indicate that the educable mentally retarded 
are capable of manifesting and sustaining Improvement In speech and language 
performance after therapy. Ensminger and Smith (1965) statfe, "knowing, that 
specific language skills can be improved and that retardates display a rather 
distinctive profile of their own, group language programs should be developed 
with this pattern of abilities and disabilities as the focal point." (p. 104). 

Early attempts at therapy for language disabilities were reported with 
Optimism, but were not objectively evaluated. Encouraging progress has been 
reported with the educable retarded; the trainable child, however, presents 
Some difficulty. Since many of the children involved in these studies were 
Institutionalized and since the size of the group was limited, it becomes 
difficult to generalize from these findings to the population of trainable 
mentally retarded children who are enrolled in special classes in the public 
schools. 

A factor of possible significance which may s^rve- to influence the result 
of research concerning the effectiveness of language stimulation for trainable 
mentally retarded children may be the amount of t: J^atment which is pro* Lded. 
There is a lack of research information indicating, for example, how many peri 
of language training are necessary during the period of a year in order for 
such children to achieve significant improvemenLs in language- 
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TREATOKNTS 

First Year : The set of 8 classes which received no special language train- 
ing went through their typical daily routines. However, the 8 classes which 
received language stimulation 4 times a week and the 8 classes which received 
the stimulation 8 times a week, were given specially structured lessons. The 
lessons were based primarily on the tasks embedded among the twelve subrasts of 
the Illinois Test of Psycholinguistic Abilities (ITPA), There were 8 lessons 
for each subtest, or a total of 96 lessons. Each lesson lasted about 25 minutes. 
The training was spread out from the start of November to about the middle of 
May. Four experienced speech clinicians carried out the program. An important 
point Is that the 8 lessons associated with any given ITPA subtest were not 
sequenced one after the other; rather, the set of lessons were distributed over 
the course of the total program so that the children would have more of a chance 
to retain and put into practice whatever benefits they had received. 

Second Year : Each child received language training stimulation 4 times a 
week. A total of 96 different lessons were available for either the Distar 
Language Program (Level 1) or the Peabody Language Devielopment Kit (Levels P and 
1). A day's session in either program lasted 25 minutes. Both language train- 
ing programs lasted from the beginning of November to afeou.t the middle of May. 
Certificated speech clinicians carried out the program. 

Third Year : During the third year of the project, each TMR child was 
assigned to one of three program offerings. The assignments were made not on a 
random basis, but instead upon the basis of what degree of language facility each 
child had. In this manner, the treatments were assigned in a rational, real-life 
fashion. It must be remembered that the primary goal was the appropriate education 
of the children, not basic research. However, in spite of what might be termed 
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various types of self-selection biases at work (related to language facility, 
which might be termed a developmental/organlsmic variable, as opposed to sex, 
which would be the classical type of organismic variable) » the reader will see 
later in the design itself that the final, data actually analyzed arose from 
representative, randomly selected children. 

The three treatments consisted of the Peabody Program (Level, P) , the Distar 
Program, or a- combination of the two. Those children who .were essentially non^ 
verbal received the Peabody Level P offering. Those children who had some language 
facility (knowledge of some words and phrases) were given the combination of 
Peabody Level P and Distar. Finally, if a child'' could use at/least short sentences, 
he was given the regular Distar program. 

The third year was devoted to implementing the Peabody Language Development 

/ 

Kit program (Level P) with the young, 'non-verbal trainable mentally retarded 
children, using this same program with a more s^uctured format as the children 
gain verbal ability at the word level. The/Distar program will be used for those 
children who have attained phrase level or higher. In addition to these programs, 
specific lessons which were intended to facilitate and generalize verbal expression 
were also carried out. This continuum of language programs was implemented in a 
small group by the classroom teacher. During the third year, specific training 
was provided to each of the classroom teachers involved in the project. This 
training was carried out during the regular school year. 

SAKPLE 

First Year : The first year's sample consisted of 157 children located in 24 
classes for the trainable mentally retarded. The children were between 7 and 14 
years of age and possessed IQs betveen 25 and -50. From this population of 157 
children, 120 were randomly selected. That is, 10 children were randomly selected 
from the 12 research design cell combinations formed by the factors of treatments 
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(3 levels) by IQ (2 levels) by sex (2 levels). 

Second Year ; The second year's population consisted of two groups: contin- 
uees (those who were in the first year of the study) and new entries (those who 
were brought into the study only during the second year of the project). In par- 
ticular, there were 116 cohtinuees (out of the origintil 157) and llA new entries. 

Third Year : Students who were in the Intermediate Unit's TMR classes for 
the trainable mentally retarded were involved in this project. During the first 
and second years 6f the project, students were involved in varying ways with ITPA 
training, Peabody training, and Distar training. As with any large public school 
operation, the Intermediate Unit's TMR classes during the third year had some 
children who were continued from the first project year, some who had entered 
/during the second year, and continued into the third year, and still others who 
were new entries for the third year itself. Further, not all children from the 
Intermediate Unit's classes were involved in any given year of the project in the 
activities in question for that year, ThuJ, over the three years of the project, 
one has several possibilities for a student's involvement. Table 1 summarizes 
the possibilities of a student's continuation status in third-year activities. 
The variable of continuation status becomes crucial in the actual design of this 
third-year study, as the reader will see later. 

Before any purging of data occurred due to continuation status or missing 
data, there was a total population of 233 children. 

For reasons that should be clear to the reader, the investigators decided to 
eliminate any child with a continuation status of 2, 3, A, or 5. Apart from the 
data arising from such continuation statuses being of doubtful "cleanliness,*' the 
numbers of children in these categories did not warrant further exploration. 

Finally, another area of elimination from the formal concerns of this report 
were those children who received the combined Peabody/Distar programs, 

11 
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For the Peabody-versus-Distar comparison, 5 student^ were randomly selected 
from each design cell to yield a total of 40 children. For the Distar intensive 
analyses, 4 children were randomly selected from each design cell to yield a 
total of A8 children. 
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INSTRUMENTATION 

First and Second Years ; The fitst yearns operation of the Language Training 
Project yielded very limited treatment effects as gauged by the standardized 
tests used. Several staff members felt that this poor showing was due, to some 
extent, to the fact that the tests in question (which are among the best recog- 
nized instruments currentJ.y available) do not adequately tap the language function- 
ing of interest to the study. The specific low level of language functioning given 
by trainable retarded children may require instrumentation not currently available. 

During the first year of -project operation, one very involved form of testing 
was that of Myklebust's Picture Story Language Test, as modified for this scudy 
(see Leiss, 1974). Myklebust (1965) used an action-packed picture to elicit 
samples of a student *s written language. In contrast, the second year used an 
adaptation to the extent that a student *s language was elicited in oral rather 
than written form; these oral language samples were tape recorded to preserve 
them exactly for later scoring. During the first year of the project, three 
pictures were used. Each picture was measured for "Productivity" by -means of 
three Criteria: total words, total sentences,, and words per sentence.. Further, 
each of the three pictures was evaluated for "Meaning /Contetit" by means of 
Myiclebust^s "Abstract-Concrete Scale." 

Because of the meager testing results and because of the large amount of 
work involved in deriving the total of four different scores for each of the three 
pictures, the modified Myklebust Picture Story Language Test (dubbed* "Language 
Sample" for this study) was largely omitted from the second project year design. 
One notable exception was to give the L'-^.nguage Sample to students who had continued 
from the first project year into the second project year. The main reason for this 
exception was to assess the longitudinal summei-lag forgetting phenomenon in train- 
able retarded' children. To project staff knowledge, such data have never before 
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been reported in the literature. Thus, no post-tiesting was given at the end of 
the 1973-1974 year in terms of the Myklebust Language Sample. It was felt the 
saving in time was more than justified. 

With the above reduction in total individual test administration time 
required for each child, the second project year opted to maintain a minimal 
battery of pr.e- and posttesting. Three instruments would be given as the measure- 
ment core: Peabody Picture Vocabulary Tes^, Illinois Test of Psycholinguistic 
Ability, and the Mecham Verbal Language- Development Scale. 

Third Year : A variety of measurement approaches were used in this study to 
tap as- many different language-related skills as possible. A series of standard- 
ized and/or commonly used devices were employed. Also, a type of criterion-refer- 
enced measurement (CR^-I) was embodied in the intensive study of the Distar program. 

With regard to standard measurement, technology, several approaches were used. 
First, selected subtests of the Illinois Test of Psycholinguistic Abilities (ITPA: 
Revised) were given to all children. In particular, the Auditory Association, 
Visual Association, Verbal Expression, and Grammatic Closure subtests were selected 
(scaled scores only). Second, the Peabody Picture Vocabulary Test (Form B) was 
given (mental age). Third, the Spencer Memory for Sentences Test was used. Apart 
•from these routinely used devices, the Distar Reading Program Placement Test was 
also give'n to all children. The Placement Test is divided into Parts A, B, C, D, 
and E; in addition, a Total score is' also yielded. 

The second major phase of the third year concerned^ only the Distar program 
itself. It was felt that an intensive investigation of the Distar program would 
reveal interesting facts about how language develops in TMR children subjected to 
a highly structured, commercially available program. The school year was divided 
into seven periods. At the end of each period, the lesson number in the books 
which the children were currently using was recorded and used as the dat-;:. input- 

n 
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Forjeach period, there were five possibilities: Book A, Book B, Book C, Story 

i 

Boojcj, and Coloring Book. Various schemes of reporting in-process Distar .progres: 
were explored prior to settling upon this final choice, but enormous problems arose 
; in selecting a uniform, meaningful method of scaling/ The lesson numbers seemed 
to be the most ititerpretable system of recording progress. In this sense, thie 
data system is akin to a criteripn-relorenced measurement (CRM) system in which 
only absolute levels of perf '^rraance are reported with no relative comparisons 
being made. 

Finally, the Distar Placement Test needs some further elaboration. The Test 
(located in Book A, Part I, of Distat Language I: An Instructional System, by 
S. Engelmann, J. Osborn, and T. Engelmann, Chicago: Science Research Associates, 
1972) .consists of Segments A through E. ' Segment A contains 5 tasks, each of which 
deals with "Action a;id Identity Statements." For instance, a child would be asked 
to poir. t to an ani'nal and then as^ed what the anfmal was doing. The respective 
"number of items for each Task is as follows: Task 1, "2; Task 1^ 2; Task 3, 5; 
Task A, 2; Task 5, a. Segment B deals with "Can/Is Action Statements." The child 
merely has to answer ''y^s" or '*no". to such items. There are 6 such items. Segment 
C dealB with "Polars." For instance, the child is asked whether something is long 
or short and merely has to answer "yes" or "no". There are 5 such items.' Segment, 
D deals wir h "Polars" also, but at a highet conceptual level . The child is shown 
a big object, for instance, but LPen .-^.s asked to tell about the opposite concept, 
a similar object. There are A items. Segment E deals with "Prepositions." There 
are 7 items. 

I 5 
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DESIGN AND ANALYSES 

' First Year ; Pre-testing ) 
and post-testing was carried out with the Peabody Picture Vocabulary Test 
(PPVT) and the twelve subtests of the Illinois Test of Psycholinguis tic 
Abilities (ITPA). Further, it was later decided to add the Mechara Verbal 
Language Development Scale (VLDS) to obtain still another outside criterion 
of language development. The testing was carried out at both the start 
and conclusion of the lengthy training period (November to May). 

Eye-y attempt was made to ensure rhat the three groups of classes 
were comparable at the start of the stud-. Because the children had 
to be kept in their original classes due to administrative and logistical 
reasons, randomization could be used only at the class level. Thus, the 
24 classes were randomly distributed among the three group settings: 
(a) no stimulation, (b) four tifeies a week, or (c) eight times a week. 
Further, initial comparability of the three sets of classes was achieved 
by analyzing pre-test differences on the PPVT, ITPA, and VLDS. Besides 
using the total raw scores from each of. the three criteria, the twelve 
subtest raw scores ^from the ITPA were also analyzed. The BMD02V computer- 
program for analysis of variance for factorial design (version of July 22, 
1965) frbra University of California at Los Angeles was used. Each analysis 
of pre-test differences embodied three factors: (a) treatments, (b) IQ, 
ana (c) sex. However, because IQ and sex differences were not of immediate 
Interest for establishing initial equivalence of groups, only the factor 
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of treatments will be considered here. No significant pre-test differences 
with regard to treatments were found on^ any ot the 15 analyses. Thus, 
for all intents and purposes of the evaluation design, .the three sets 
of groups can be considered initially equivalent. (It should be noted 
that unequal cell frequencies were present in the original three-factor 
data matrix involving 157 children. Several chronic absentee children 
were among the 157 children. After the decision was made to remove these 
absentees from the initial data matrix, the new unequal cell matrix 
comprised 1A8 children. To achieve final equal cell frequencies, a 
cell size cf ten was decided upon and children were randomly deleted 
from the appropriate cells. The resulting matrix, also used in later 
analyses, contained 120 children.) 

^ Once initial equivalence of the three sets of - groups was ascertained, 
a formal program evaluation design was selected. In particular, besides 
the three factors of treatments, IQ, and sex, a fourth factor of measures 
(pre-test versus post-test) was added. The resulting four-factor design 
was of repeated-measures type. As with the 15 pre-test analyses, 15 
analyses were run in the repeated-measures framework. The computer program 
use4. was BMDQ8V of the UCLA Biomedical package (yersion of September 1, 1965) 

The reader^ should note that in every one of the 15 analyses, a .-nixed 
effects model was derived. That is^, the factors of treatments and sex 
were considered fixed, but the factor of IQ (high and low, as determined 
by an approximate median split) was taken to be random. (Of course, 
replications or subjects were considered random in the data matrix therein 
120 children were left after removing unequal cell frequencies.) 

Apart from the four-factor, repeated-measures design used in the 15 
'gain analyses, descriptive analyses were also undertaken of the variable 
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of socioeconomic status (SES) . While' IQ has remained the main control 
variable of interest used in the above-mentioned 15 gain analyses, SES 
was also of interest. SES could not be included as a tifth factor in 
the design for the above, 15 analyses because the distribution of 
frequencies among factors was too uneven. Thus, it was decided to 
analyze separately in a descriptive way the effect of SES on the three 
treatments. The SES measure was the Minnesota Scale for Paternal 
Occupations; categories I to IV v:ere considered High SES, while V to VII 
were Low SES, 
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Second Year ; The primary concern of this study was the treatment comparison 

between the Peabody program and the Distar program. Wherever administratively 

possible, the classes containing both continuees and new entries were randomly 

assigned evenly between the two treatment conditions. Because of the potency 

of the IQ factor as a control variable, the second factor included in the design 

was IQ. A median split was employed so that low IQ represented 21 to A3, while 

high .IQ was AA to 53. The third factor was sex (males versus females). The 

fourth factor was measures (pretest versus posttest). Thus, the basic design 

for several analyses was a four-factor , repeated-measures design: treatments 

by IQ by sex by measures. i 

Besides the four-factor design mentioned above, a fifth factor was embodied . 
for certain analyses, namely, entry status. This factor had two levels: new 
entry versus continuee. Thus, the few analyses that included this fifth factor 
were of a five-facotr, repeated-measures design: treatments, IQ, sex, entry 
status, and measures. • ' 

One series of analyses dealt with the three criteria of the 
PPVT, ITPA, and VLDS . The pretest'and posttest data from the 197'3-197A year 
were placed within the four-factor design mentioned above. iPor each of the 
criteria, two separate analyses were performed: one for continuees and one 
for new entries. However, before any. analyses were run, 7 children were ,ran-. 
domly selected from each of the independent-factor cells (treatments by IQ 
by sex). Thus, eath of. the analyses had 56 children drawn at random from cither 
the 116 continuees or the llA new entries. A total of 6 such analyses were run. 

A second set- of analyses built in as u factor the comparison of continuee 
versus new entry. Each of these analy;scs was again done on 1973-197A data of 
prctest-posttcst type for the PPVT, ITPA, and VLDS. A total of 3 such analyses 
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were run. 

A* third set of analyses used only the data from the 56 continuees- These 
analyses represented longitudinal' studies . This set of analyses involved the 
,four pretest-posttest measures from both 1972-1973 and 1973-1974.. Three of these 
analyses were run: PPVT, ITPA, and VLDS. 

' A fourth set of longitudinal studies were run on the 56 continuees with 
regard to the Myklebust Language Sample data. The input consisted of the pre- 
test and posttest of 1972-1973 and the pretest of 1973-197A. Six such analyses 
were run: total words, total sentences, modified words per sentence, words per 
sentence as per Myklebust, abstractness-concreteness score, and average abstract- 
ness-concreteness score. 

In all analyses, the BMD08V program of the UCLA Biomedical series was used. 
The analyses were run on a CDC 6400 computer at Lehigh University. A mixed 
design was specified, with treatments, sex, and measures as fixo.d factors, while 
IQ and replications were random factors. 

. ■ . f 
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Third Year ; ... 

The two basic sets of analyses In the third year of this project deal with 
the Peabody versus Distar comparisons and with the Distar intensive comparisons. 

For the Peabody versus Distar phase, there were five factors: treatments (T) , 
sex (S) , continuation status (C) , measures (M) , and replicates (R) . Replicates 
was taken as a random factor, while the other four factors were interpreted as fixed. 
To achieve equal cell frequencies in this design, five children were randomly sampled 
from each design cell. Thus, while treatments were not originally assigned in a 
.random way (but rather . in the , real-life manner of what treatment would be most 
appropriate with a -given child), the final design involved- random sampling of children. 
Replicates were nested under the factors of treatments, sex, and continuation status. 
With the exception of replicates (which had five levels), all factors had two levels. 
While the factor of IQ might have been included, the design already was "saturated'* 
with factors that did not allow any further stratifying of subjects if a reasonable 
number of children yeve to b*e kept in each cell. In terms of any differential bias 
that might exist in favor of a ^treatment, the pre-level of the test in question, as 
well as the previous experience of the children as indicated by continuation status, 
were deemed more crucial factors than IQ itself. . < - 

For. the Distar intensive phase, three different sets of results were generated: 
(a) pretest-posttest results on the same criteria as in the Peabody versus Distar 
phase,^ but this time replacing treatments fey IQ to obtain additional information 
on the Distar Program itself; (b) intra-year findings for each of the 7 periods 
(separately) of ' the school year that the project used for CRM data recording; and (c) 
intra-year data across all periods. In these regards, all three sets of Intensive 
Distar analyses were loosely looked upon as "longitudinal" in nature. For (a), 
five factors were embodied: sex, IQ, continuation status, replicates, and measures. 
IQ was determined on an approximate, median-splitf basis; which in this study happened 
to be between 44 and 45; thus, the factor of IQ was random r-ather than fixed (that is, 
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it was empirically determined rather than logically/arbitrarily predetermined). 
Replicates was also taken at random. Sex, continuation status, and measures were 
all interpreted as fixed. All factors had two levels, with the exception of con- 
tinuation status, which had three (E.S.I, C.S.6, and C.S.7), and replicates, which 
had A level-s (A randomly selected children from each categorical cell). For (b) , 
measures as a factor was omitted. For (c) , the few criteria that had complete 
data across all seven periods of the year employed the same kind of, and number 
of levels of, factors as in design (a) with the exception of measures, which now 
had seven levels. 

From the above '^Design" discussion, the reader should be able to visualize 
the nature of the three different designs used in this study: (a) Peabody versus 
Distar, (b) Distar Intensive Analyses (Single Criteria), and (c) Distar Intensive 
Analyses (Multiple Criteria) . 

In all analyses, the BMD08V program of the UCLA Viomedical series was used. 
The analyses were run on a* CDC 6A00 computer at Lehigh University, Bethlehem, 
Pennsylvania. . ^ 

So that the reader understand in what manner the statistical tests of signi- 
ficance were carried out, it is necessary to describe briefly the error terms. 
In this discussion, several abbreviations will be used: T (treatments), S (sex), 
C (continuation status), M (Measures), and R (replicates). - For design (a), the 
Peabody versus Distar phase of the study, the variance source to be tested for 
significance and its appropriate error term, are given respectively in pairs as 
follows: Mean, R (TSC); T, R (TSC) ; S, R (TSC) ; C, R (TSC) ; M, RM (TSC) ; TS, R 
(TSC); TC, R (TSC); SC, R (TSC); TM, RM (TSC); SM, RM (T§C) ; CM, RM (TSC) TSC, R 
(TSC); TSM, R>I (TSC); TCM, RM (TSC); TSCM, RM (TSC) ; and RM (TSC), not tested: 
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For design (b), the Distar intensive analyses of single criteria, the 
variance source to be tested for significance and its appropriate error term, are 
given respectively in pairs as follows (Q denotes IQ) Mean, Q; S, SQ; Q, R(SQC); 
C,^ QC; SQ, R(SQC); SC, SQC; QC, R(SQC)1 SQC, R (SQC) ; and R(SQC), not tested. 

For design (c), the Distar intensive analyses of multiple criteria, the 
variance source to be tested for significance and its appropriate error term, 
are given respectively in pairs as follows; Mean, Q; S, SQ; Q, R(SQC); C, QC; 
M, QM; SQ, RM(SQC>; CM, QCM; SQC, R(SQC); SQM, RM(SQC) ; SCM, SQCM; QCM, RM(SQC) ; 
R(SQC), not tested; SQCM, RM(SWC) ; and RM(SWC), not tested. 
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RESULTS AND DISCUSSION 

First Year ; 

The Language Training Project has many important implications for 
realistic school practice and for future research. The main variable 
of interest was the intensity of application of a single training approach 
based around the twelve subtest tasks of the ITPA. 

There are two findings that have a huge impact on realistic school 
functioning. First, there were no significant differences among levels 
of intensity (the "treatments") in any of the 15 analyses. Second, only 
two out of the 15 analyses yielded any significant changes (one was a 
gain and the other was a loss); in general, there appeared to be little 
improvemeut of the children. From these two results, there seems to be 
only one conclusion possible: specific, prolonged language training based 
upon the ITPA is ineffective no matter what the intensity of application 
is. However, it should be noted that this does not mean that other 
types of language training with the trainable mentally retarded would be 
similarly ineffective. Nonetheless, this general conclusion must be temper 
by the presence of some significant interactions. 

The treatment-by-lQ interactions which occurred (seven Significant 
ones out of 15 total) showed two situations. In three of the, seven 
interactions, the logically expected superiority of the High-IQ group over 
the Low-IQ group dfd not materialize for the f our-times-a-week group. In 
the other four interactions, the expected superiority situation did not 
occur for either the f our-times-a-week group or the eight-times-a-week 
gr-oup. From these findings and Insepctions of tables not included In this 
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because of their excessive detail, it appears that the specific language 
training actually ^^Impeded the High~IQ groups. 

In suiDinary, this research report of the first year's results showed 
that specific language training based upon the ITPA has no effect on 
trainable mentally retarcied children and in fact seriously hinders the 
upper level IQ group in this population.. 
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Secord Year ; The basic evaluative emphasis during the second year of the 
project was on the global pretest-posttest assessments via the PPVT, ITPA, and 
VLDS. Because of the many univariate analyses performed in this annual project 
evaluation, some words of interpretative caution should be attached to the results 
Primary weight should be attached to the findings from the analyses that embody 
the most all-emcompassing comparisons among both continuees and new entries for 
the three primary criteria, and for the analyses that embody the most all-encom- 
passing comparisons among just continuees for the longitudinal (two-year) data for 
the three primary criteria. With these precautions as. a preface, the basic find- 
ings will be discussed. ' 

For the analyses containing both continuees and new entries, there are no 
generalizable treatment effects in favor of either Peabody or Distar. This is 
to be expected because human language behavior is so complex that one would hardly 
expect one program to be effective for all levels of disability or functioning 
within the TMR population. Thus, one looks to the interactions with treatments 
to provide the qualifications on lack of general findings that say in specific 
levels or TMR functioning, certain programs may nonetheless be effective. In the 
analysis of VLDS the treatment-by-IQ interaction was significant. Not only did 
''the Distar groups surpass the Peabody groups, but the low-IQ group did not lag so 
far .behind the high-IQ group with the Distar program as -they did in the Peabody 
program. In the analysis of PPVT, no two-way interactions with treatments were 
significant. In the analysis of ITPA, again no significant two-way interactions 
with treatments were found. ■ ^• 

In terms of gain during" just the second year, none of the three primary 
criteria showed significant movement for the analyses that included both con- 
tinuees and new entries. Further, none of the- interactions with gain were signi- 
ficant on VLDS. However, for PPVT, the IQ-by-gain interaction was significant. 
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Regardless of language program, low-IQ children actually lost over time, while 
high-IQ children gained over time. On, the ITPA, again no significant two-way 
interactions with gain occurred. 

Focusing just on the continuees from the first year of the project, one can 
detect some interesting trends. Here, longitudinal data was used' from both 
project years. On the ITPA, the main effect for treatments was not significant, 
nor were any of the two-way interactions with treatments. On the PPVT, a differ- 
ent picture emerged. The tre?tment-byr-IQ interaction (£<.05) showed that while 
no overall difference between Peabody and Distar existed, the low-IQ children in 
Peabody were greatly hindered in comparison to the other three rreatment-by-IQ 
combination groups. Also on the PPVT, the treatment-by-gain interaction (£<.05) 
showed the continuees had significantly higher performance in the Distar groups 
than in the Peabody groups, with the greatest gain occurring during the second 
year. On the VLDS, no general treatment effect occurred, but two interactions 
with treatments are worthy of discussion. The treatment-by-IQ interaction (£^< .05) 
showed that for continuees, the low-IQ Peabody group performed significantly worse 
than the high-IQ Peabody group, while the corresponding difference in the Distar 
groups was in the same direction but less pronounced. The treatment-by-sex inter- 
action was also highly significant (£<.01); in the Peabody groups, males were 
significantly lover s.han females, while the reverse was true in the Distar groups. 

The final reflections on analyses containing only the continuees deal with 
the gain phenomenon. On the ITPA, a significant (£< .01) gain occurred regardless 
of treatment. However, strangely enough, while the posttest of each year was 
highier than the corresponding pretest, the overall performance of the first year 
was higher than the second year. No overall change occurred in the analysis of . 
the PPVT; however, change did occur depending upon treatment group (a finding 
already discussed above). .^On the VLDS, a highly significant <£<.01) change over- 
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time occurred regardless of treatment* Here, there vas a notable gain 'during^ 
the second year of the project for the contin^^^^^' ^hiie their first year's per- 
formance was more or less static. Also on the VL^S, there was a significant 
(£^<.0l) IQ-by-measures interaction. The interaction vas caused mainly by the 
low-IQ students losing at a greater rate than the hig^^^jq students. The low-IQ 
students on an average lost twice as many points over summer as did the high-IQ 
students. Also, while the high-IQ students finally got back up at the end of the 
second year where they had been at the end of'^'^^ ^i^st year (liut got vio higher!) 
the low-IQ students did not even get up to the level they were at during the end 
of the first year. 

In summary, then, the above findings are those i^ ^hich perhaps the grea^test 
degree of confidence could be placed in lieu of actually having a multivariate 
analysis of variance design. From this brief precis of key fi^^dings, '^t now 
remains to put. a perspective on them. 

':What can one conclude from the primary set of Results? With regard to 
treatments, while no significant differences ^jmerged for^ the high-IQ children, 
the low-IQ children were aided moresso hy Dista^ than by Peabody. Further, the 
continuees showed greater gains during the sc;^ond year of the project in Distar 
than in Peabody. Of course, one must remember that these continuees during the 
first year were in various types of ITPA-based language st insulation programs. 
Thus, these children who continued on into the second year of the project (at 
which time Peabody. and Distar were introduced) benefit of earlier language 

stimulation, although the first year's project report indicated such ITPA-based 
training was of minimal value. (Children who had "Various degrees of ITPA-based 
training during the first year were, of course, randomly represented in each of 
the Peabody and Distar groups so that no differential pre-treatment contamination 
existed at the start of the second year), • ...... 
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In terms of change ovei' time, two observations are possible. First, because 
of the poor showing in analyses 7, 8, and 9, gain in the total sample was not 
marked (±.e., in those analyses where both continuees and new entries were con- 
sidered). However, when one considers only the continuees, sigrificant gain in 
language functioning did occur. Second, the summer lag phenomenon did occur for 
those TMR children who were continuees; that is, in considering the posttest from 
the first year and the pretest of tha seco^'td year, a marked dt crease in performanc 
occurred. 

The final set cf observations concern the measurement realm. It is clear 
that the battery of standardized tests used in both the first and second years 
of the project have not been specific enough to tap areas of language functioning 
of concern to this project. That is, the PPVT, ITPA, and VLDS are simply not 
valid 'enough reflections of the types of language training used with the TMR . 
children. The sensitivity of these instruments is extremely poor for detecting 
subtle changes in TMR children's performance.* Just what measurement dev^.ces 
might be substituted for the present ones is a question to which the present 
investigators cannot give a legitimate ainswer. It would seem desirable to con- 
sider impiemenuing a curriculUm-based, criterion^ref erenced measurement system. 
For example, if one is in the Distar program, then perhaps a recording system ' 
could be developed that would reflect developmental mastery changes of the chil- 
dren as they move throughout the various sequential units of Distar. In its 
crudest form, this CRM system might use only the sequential unit numbers at the 
end of every week or every two weeks for each child throughout the school year. 
One could make the CRM system a little more precise if he not only considered 
developmental unit numbers (which reflect an implicit mastery of the curricular 
continuum) but also ap/ended percentage. mastery scores on some criterion attached 
to each unit. '29 
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The last set of measurement considerations concern the analyses that were 
considered only subordinate in importance: longitudinal language sample data 
on continuees. With regard to the Myklebuist modified Language Sample procedure, 
a few general conclusions are possible. First, the procedure is time-consuming 
both to administer and to score 4 Second, when all., the various scores of the 
Language Sample are considered, only the Total Words and Total Sentences appeared 
to be sensitive to the types of language functioning of I>IR children. 

In summary, then, the second-year results appear to be more positive (mainly 
in favor of Distar over Peabody) than the first-year results in which different . 
intensities of ITPA-based language training yielded a very bleak picture. None- 
theless, even the! second-year findings are relatively ^mild and contain no stunning 
revelations. 

Third Year (Peabody Versus Distar) ; The reader must be cautioned on the 
restrictions^ that pertain to this design. In brief, these constraints involve 
the self-selection bias built into the study by default, \dLth the more language- 
advanced students receiving the Distar program. If there is a signific^^nt bias 
at work that contaminates treatment comparisons in any noticeable way, then this 
should be easily detectable by examining the interactions between treatments and 
measures; in particular, the pretest levels pitted againsc treatments should show 
the obvious differential biases at work. Out of a total of 12 analyses of Peabody- 
versus-Distar type, only two showed a significant treatments-by-measures inter- 
action. These two results involved the criteria of Parts D and E of the Distar 
Placement Test. On all 6ther criteria (Distar Placement Test: Total Score and 
Parts A through C; I'^PA: Grammatic Closure, Auditory Association, Verbal Express- 
ion; and Visual Association; Spencer Menjory for Sentences Test; and Peabody Pictur< 
Vocabulary Test) the treatments-by-measures interactions were insignificant* 
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Because of this situation, the investigators decided to simply omit any discussion 
of the contaminated Parts D and E of the Distar Placement Test and to proceed 
very cautiously with the results of the remaining criteria. 

Of primary interest to this , study was the main effect of treatments. Of the 

1 r 

10 "uncontaminated criteria," the treatment main effect was significant in all 
cases except on ITPA Grammatic, Closure and ITPA Visual Association. Of the 8 
significant treatment effects, the Distar Programwas clearly superior to the 
Peabody Program on all of the criteria except ITPA "Auditory Association, on which 
the opposite finding occurred. 

With regard to the other variables in the design, little was found except for 
measures. Sex yielded no differences in language activity except on ITPA Visual 
Association, on which boys were superior to girls. . No difference at all was found 
among the 10 uncontaminated criteria for -the variable of continuation status; 
apparently the previous, formal language experiences of the children had little 
effect.^ The variable of measures yielded significant main effects on each of the 
10 criteria except on the Spencer Memory for Sentence Test. Thus gains uniformly 
occurred, regardless of treatments. 

With regard to interactions on the 10 uncontaminated criteria, the only 
significant two-way result was for sex by continuation status on ITPA Visual 
Association; in particular, for C.S.5, girls performed significantly higher than 
boys, while for CS.l no difference occurred. 

Fortunately, triple interactions were kept to a minimum on the 10 criteria of 
"uncontaminated" type. Thus, discussion of results is greatly simplified. The 
only exceptions were treatments by continuation status by measures on ITPA Verbal 
Expression, treatments by sex by continuation status on ITPA Visual Association, 
treatments by sex by measures on Distar Placement Test (Part A), and treatments 
by continuation status by measures on Distar Placement Test (Part C) . 
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Third Year; Dlstar Intensive Analyses (Pretest-Posttest Findings) : Basically, 
this series of analyses was the logical continuation of those presented above and 
changed only In the sense of replacing the treatments factor with the IQ factor. 

The purpose of these analyses was ..to examine more Intensively any trends that 

f ■ 

might be at work within the Dlstar Program Itself (which was the main focu^ of 
interest of this third year of the project). 

As with ^he ,Distar/Peabody analyses above, the pretest-posttest analyses again 
had 12 criteria. In no case did any sex differences arise, with the exceptioti of 
a complex triple interaction among sex, IQ, and measures. On all but 4 of the 
criteria, IQ operated effectively in the expected direction to control some of 
the variation (the exceptions were ITPA Grammatic Clo53ure, ITPA Auditory Association 
ITPA Verbal Expression, and ITPA Visual Association), The latter finding again 
raises some validity problems on the ITPA, since an IQ difference would be expected 
on virtually all criteria. Continuation status produced no difference at al^l on 
any of the 12 criteria. Measures (pretest and posttest) was significant on only 
one criterion: Dlstar Placement Test (Part E) . 

Apart from the main ' effects , the only other results that might be taken note 
of were an IQ-by-measures interaction on four of the 12 criteria (Dlstar Placement 
Test: Total Score, Part A, and Part C; ITPA Visual Association). Two . triple inter- 
actions (sex by continuation status by measures, and sex by IQ by measures) were 
also significant but are too complex to go into for .brief summary p'urposes here. 
- Third Year: Dlstar Int e nsive Analyses (Intra-Year Findings Separately for 

Each of 7 Periods) : This series of analyses looked at the CRM data that was 
yielded in day-to-day contacts with the program. The lesson numbers of Dlstar 
Book A, Dlstar Book B, Dlstar Book C, Dlstar Story Book, and Dlstar Coloring Book, 
were monitored , for each child in the^ Dlstar Program at the end of each of 7 evenly 
spaced time periods throughout the school year. During certain periods one will 
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notice that not all 5 possible lesson numbers were involved in the analyses (e.g., 
•Period 1 nas only Book A and Book B) ; this situation is merely a reflection of the 
fact that developmentally, the children^ did not begin using certain of the more 
advanced components (Book C, Story Book, and Coloring Book) until greater facility 
in language skills had been achieved at the lower levels (Books A and B) . 

There were a total of 22 CRM measures (as discussed earlier in the "Insttuments 
section) processed during this portion of ^he Distar Intensive Study. Of the three 
factors in the design (sex, IQ, and continuation status), only IQ resulted in any 
noticeable pattern of significant differences. Of the 22 CRM measures, only 6 did 
not yield a significant IQ difference (specifically, Distar Book A, Period 1; Distar 
Book B, Period 1; Distar Book C, Period 4; Distar Story Book, Period 5; Distar 
Book C, Period 6; Distar Story Book, Period 6; and Distar Book C, Period '7) . There 
appears to be no real pattern to these nonsignificant differences, and little more 
need be said here. 

The variables of sex and continuation status were consistent in that neither 
produced any significant differences. Insp^tition of the cell .neans tables in 
Appendix C shows why this situation arose. 

Distar Intensive Analyses (Intra-Year Findings Across All 7 Periods) ; Basic- 
ally, the purpose was to look . at, those CRM Distar measures that provided data 
across all 7 periods of the school year. Only Books A and B yielded such data; 
this situation was a partial reflection of the develof^mental problems of the 
children and the resultant times at which they stat.red Distar Books A, B, or C, 
or the Story Book, or the Coloring Book. 

As one can see from the tables, sex and continuation status were insignificant 
on both Book A and Book B. The factor of.IQ worked effectively to isolate sonie of 
the error variance; the significant difference was in the expected direction on 
both measures. The factor of measures resulted in significant progress being 
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steadily shown throughout the school; in Book A, the lesson numbers ranged from 
33,12 in Period 1 to 77,3 in Period 1 , while in Rook B, the range was from ''i.42 
in Book A to 77.23 in Period 7. The only other significant result was the IQ-by~ 
measures interaction, on both criteria; no other intreractions were significant on 
either criterion. 
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TABLE 1 



CONTINUATION STATUS LEVELS 



LEVEL 



C.S. 1 
C.S. 2 
C.S. 3 

C.S. 4 
C.S. 5 
C.S. 6 

C.S. 7 



DESCRIPTION 



1974-1975 ( 1 year of contact ) 

case study control group ( 3 years of contact ) 

controls in 1972-1973, then regular program in 1973-1974 
and also 1974-1975 ( 2 years of contact ) 

in year 1, out year 2, in year 3 ( 2 years of contact ) 

in all 3 years, but case study in year 1 ( 3 years of contact ) 

in year 2 (1973-1974) and year 3 (1974-1975) ( 2 years of 
contact ) 



in all 3 years ( 3 years of contact ) 
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